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Suppose two distant observers Alice and Bob share a pure bipartite quantum state. By applying 
local operations and communicating with each other using a classical channel, Alice and Bob can ma- 
nipulate it into some other states. Previous investigations of entanglement manipulations have been 
largely limited to a small number of strategies and their average outcomes. Here we consider a gen- 
eral entanglement manipulation strategy and go beyond the average property. For a pure entangled 
state shared between two separated persons Alice and Bob, we show that the mathematical inter- 
change symmetry of the Schmidt decomposition can be promoted into a physical symmetry between 
the actions of Alice and Bob. Consequently, the most general (multi-step two-way-communications) 
strategy of entanglement manipulation of a pure state is, in fact, equivalent to a strategy involving 
only a single (generalized) measurement by Alice followed by one-way communications of its result to 
Bob. We also prove that strategies with one-way communications are generally more powerful than 
those without communications. In summary, one-way communications is necessary and sufficient for 
the entanglement manipulations of a pure bipartite state. The supremum probability of obtaining a 
maximally entangled state (of any dimension) from an arbitrary state is determined and a strategy 
for achieving this probability is constructed explicitly. One important question is whether collective 
manipulations in quantum mechanics can greatly enhance the probability of large deviations from 
the average behavior. We answer this question in the negative by showing that, given n pairs of 
identical partly entangled pure states l^*)) with entropy of entanglement £'(|^')), the probability of 
getting nK {K > _E(|^'))) singlets out of entanglement concentration tends to zero as n tends to 
infinity. 



PACS Numbers: 03.67.Dd 

I. INTRODUCTION AND SUMMARY OF KEY 
RESULTS 

Entanglement — the non-locality of entangled state — 
as in the Einstein-Podolsky-Rosen paradox discov- 
ered by J. Bell |^ in 1964, has long been regarded a 
hallmark of quantum mechanics. In the past, entangle- 
ment was often regarded as a qualitative property of a 
state. The last few years, however, witnessed a dramatic 
change in the approach to entanglement. Entanglement 
is now regarded as an important quantitative and use- 
ful resource in achieving tasks of quantum information 
processing such as dense coding [|| , teleportation j|] and 
reduction of communication complexity 

The study of quantum information processing has been 
complicated by the fact that entanglement can appear 
in many non-standard forms. Fortunately, it is known 
that two distant parties sharing a bipartite pure state 
can apply local operations and classical communication 
to "manipulate" entanglement, thus converting one form 
to another To better understand quantum informa- 
tion processing, it is important to discover the fundmen- 
tal laws of a general entanglement manipulation. Most 



of the previous investigations have focused on some par- 
ticular types of entanglement manipulations, namely, en- 
tanglement concentration and dilution of an ensemble of 
identical states, say $7 = (a|ll) -f b\22))^ , which are col- 
lectively processed. Moreover, the main interest was in 
the average properties and little is known about the ac- 
tual probability distribution of the outcomes of those ma- 
nipulations. 

This paper concerns mainly the fundamental laws of 
entanglement manipulations of pure bipartite states. Un- 
like previous investigations, here we allow the initial state 
to be a single copy of a general state It is useful to 
note, however, that in fact all entanglement manipulation 
methods, both "single-pair" and "collective" ones can be 
reformulated as "single-pair" methods, by redefining the 
"particles" . Indeed, suppose Alice and Bob share n pairs 
of particles, and intend to process them by some collec- 
tive method. We can now regard all n particles in each 
side as a single "particle" , living in a higher dimensional 
Hilbert space (equal to the product of the Hilbert spaces 
of the original n particles). The n original pairs can thus 
be regarded a single pair of two (more complex) quantum 
particles, and the original "collective" manipulation can 
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be regarded as a "single-pair" type manipulation of this 
new pair. Consequently, all questions concerning collec- 
tive manipulations can be answered by studying "single- 
pair" manipulations of a generic state of two arbitrary 
particles. This is the path that we will follow in the 
present paper. 

Our results are the following. 

• First of all, we show that, rather surprisingly, gen- 
eral entanglement manipulations of a pure bipartite 
state with only one way communication are equally 
powerful as those with two way communication, but 
are more powerful than those with no communica- 
tion. 

• Then, we specialize in a class of entanglement ma- 
nipulations, namely entanglement concentration, 
from a general pure bipartite initial state 4" to a m- 
dimensionally maximally entangled state (which we 
shaU call to- Af £'-state) , $,„ = ^ Z)™ i N)a|«)b 
where \i)A and are orthonormal vectors in Al- 
ice's and Bob's Hilbert spaces Ha and Hb respec- 
tively and TO is fixed but arbitrary. (Notice that $i 
is a direct product, $2 is a singlet and $29 is equiv- 
alent to q singlets. Therefore, a TO-A/i?-state can 
be regarded as a generalization of singlets.) The 
main questions that we ask are: What is the opti- 
mal probability of getting a from ^P? and also: 
What is the optimal strategy that will achieve such 
a probability? In this paper, we give complete an- 
swers to those questions. 

• After that, we specialize to the entanglement ma- 
nipulation of a large number of identical pairs of 
$ and derive a bound to the probability of having 
a large deviation from the average property. More 
specifically, suppose that two remote observers, Al- 
ice and Bob, share n pairs of spin 1/2 particles, 
each pair in a non-maximally entangled pure state 
I*) = a|l)|l) + /3|2)|2). Then, by local actions 
(which may include local unitary transformations, 
measurements and attachment of ancillary quan- 
tum systems) and classical communications Alice 
and Bob can convert these pairs into a (smaller) 
number m of perfect singlets. It has been shown 
[H that in the limit of large n, Alice and Bob 
can perform a reversible conversion of the n pairs 

into singlets, obtaining, on average a number 
TO = n_E(^) of singlets. (Here, i?(^) is the "en- 
tropy of entanglement" of a pure bipartite state 
consisting of subsystems A and B and is defined to 
be the von Ncumman entropy of subsystem A (or 
B) Q.) Furthermore, as a consequence of this re- 
versibility property, together with the fact that on 
average entanglement cannot increase via local ac- 
tions and classical communications [^,|^, this par- 
ticular entanglement manipulation method yields 



the maximal possible average number of singlets. 
It has been shown |^ that -E(^'), the maximal av- 
erage number of singlets which can be extracted 
per original pair 4*, is up to a constant multiplica- 
tive factor, the unique measure of entanglement for 
4* that is additive and non-increasing under local 
operations and classical communication. 

However, in all previous investigations of entangle- 
ment manipulations, the focus was on the average 
values, such as on the question "What is the aver- 
age number of singlets which can be extracted from 
n pairs 5"?" Indeed, the whole idea of reversibility 
refers only to the average properties. The point is 
that, as in all asymptotic results, there is always 
a very small but non-zero probability for an en- 
tanglement concentration procedure to give a sub- 
stantially smaller amount of singlets than the ex- 
pected number. Here we want to go beyond aver- 
age values and ask about the actual distributions. 
For example, the same average number of singlets, 
TO ~ nEl'^) might, in principle, be obtained from 
very different distributions: In the reversible pro- 
cedure described in Ref. |6|, out of n pairs ^E" a 
number m of singlets is obtained with some proba- 
bility Pm, and the distribution is essentially Gaus- 
sian, peaked around to — n£'(4'). In particular, 
via this procedure the probability to obtain a large 
number of singlets, m « n is exponential small. 
However, one could envisage a distribution which 
yields the same average to = nE{'i') while having 
a non-negligible probability for obtaining a large 
number of singlets — for example, a distribution in 
which the probability of obtaining m = n singlets 
is E{'^) while in all other cases zero singlets are 
obtained. The question is "Does there exist any 
entanglement manipulation procedure which real- 
izes the later distribution?" 

A main point of our investigation is to gain a better 
understanding of the collective properties involved 
in entanglement manipulation. Indeed, if Alice and 
Bob would extract singlets by processing each of the 
n pairs 'J separately, the law of large numbers tells 
that the probability distribution of the number of 
singlets will (asymptotically) be Gaussian. Devia- 
tions from this distribution can be obtained (if at 
all) only if Alice and Bob process all the n pairs 
together. But are such deviations possible? And if 
so, how big can they be? 

[To put things in the right perspective, we would 
like to mention that the reversible procedure j|] dis- 
cussed above, is not a procedure in which each pair 
is processed separately but a collective one — yet, 
the distribution it yields is essentially Gaussian.] 

In this paper, we show that, in the context of en- 
tanglement manipulations of a large ensemble of 
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identical pure bipartite states, collective manipula- 
tions cannot substantially enhance the probability 
of large deviations from the average properties. 

• Afterwards, we consider a subclass of entanglement 
manipulation strategies in which the final state is 
always one of the possible $m's. (Say, the state $i 
appears with probability pi , $2 with probability p2 
and, $„ with probability p„, etc) We define a 
natural notion of a universal strategy for entangle- 
ment manipulations for all values of m and show 
that such a universal strategy cannot possibly ex- 
ist. 

• Finally, we present open questions in the entan- 
glement manipulations of pure states and discuss 
briefly problems in generalizing our results to mixed 
states. 

In summary, in a conceptual level, the novelties of our 
investigation are the following. 

1) Some of our results — particularly the statement that 
manipulations with one way communication are equally 
powerful as those with two way communication — apply 
to a general entanglement manipulation strategy. In con- 
trast, all previous investigations have focused on either 
entanglement concentration or dilution. 

Incidentally, we clearly demonstrate the important role 
of symmetry in entanglement manipulations. Indeed, in 
our proof that manipulation strategies of a pure bipar- 
tite state with only one way communication are equally 
powerful as those with two way communication, we are 
essentially promoting the mathematical interchange sym- 
metry of the Schmidt decomposition into a physical sym- 
metry between the local actions of Alice and Bob. (See 
section |l|) 

2) We allow the initial state to be a general In con- 
trast, almost all previous investigations have restricted 
$ to be identical copies of some state, say u = 
a|ll) +fe|22), and have been mainly concerned with prop- 
erties in the large A^ limit. 

3) We are concerned with the role of probability rather 
than the average property in entanglement manipula- 
tions. 

II. ONE-WAY COMMUNICATIONS IS 
NECESSARY AND SUFFICIENT FOR 
ENTANGLEMENT MANIPULATIONS OF 
BIPARTITE PURE STATES 

A. Reduction from two-way to one-way 
communications 

The most general scheme of entanglement manipula- 
tions of a bipartite pure entangled state involves two-way 



communications between Alice and Bob. It goes as fol- 
lows: Alice performs a measurement and tells Bob the 
outcome. Bob then performs a measurement (the type 
of measurement that Bob performs can depend on Al- 
ice's measurement outcome) and tells Alice the outcome, 
etc, etc. The goal of this subsection is to prove that any 
strategy of entanglement manipulation of a pure bipartite 
state is equivalent to a strategy involving only a single 
(generalized) measurement by Alice followed by the one- 
way communications of the result from Alice to Bob (and 
finally local unitary transformations by Alice and Bob). 

Here we introduce some definition. 

Definition 1: (Ordered Schmidt coefficients) An ar- 
bitrary pure state ^' can be written in Schmidt decom- 
position 

N 

*-^V^|a,)|&,), (1) 

i 

where {ai\aj) ~ = ^ij- We call \f\i^ the ordered 

Schmidt coefficients if the A^'s are ordered decreasingly, 
i.e., Ai > A2 > • • • > Aat. Note that all phases have been 
absorbed in the definition of the states |ai)'s so that the 
Ai's are positive real numbers. 

First of all, since it is more convenient to deal with pro- 
jection operators than positive operator valued measures 
(POVMs), we include any ancilla (measuring apparatus) 
in Alice and Bob's quantum systems. Therefore, with- 
out loss of generality, we regard Alice and Bob as shar- 
ing a pair of particles with an infinite (or an arbitrarily 
large) dimensional Hilbert space but initially only N of 
the coefficients of the Schmidt decomposition ||] are non- 
zero, i.e., 1*) = X]i=i V>^i\ai)\bi) where {ai\aj) = and 
{bi\bj) = Sij. We further assume that the above form of 
the Schmidt decomposition of j^I') is known to Alice and 
Bob. 

Second, we consider only the most advantageous en- 
tanglement manipulation scheme in each step of which 
Alice keeps track of the results of all her measurements 
and tells Bob about them and vice versa. Alice and Bob 
then update their information on the state they share in 
each step. Since it is a pure state j^*) that Alice and Bob 
start with, they always deal with a pure state in each step. 
Any scheme in which Alice and Bob choose to be sloppy 
or ignorant can be re-casted as a situation in which they 
fail to make full use of their information. Therefore, there 
is no loss in generality in our consideration. |l0|] 

We now argue that any two-way entanglement manip- 
ulation strategy for the state \'^) can be re-casted into 
an equivalent strategy which involves only one-way com- 
munications from Alice to Bob — that is to say a strat- 
egy in which Alice performs all the measurements and 
informs Bob of the outcomes afterwards. This is so be- 
cause (i) in entanglement manipulations we are mainly 
concerned with the coefficients of the Schmidt decom- 
position and (ii) in each step of entanglement manipu- 
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lation, the Schmidt decomposition of the pure state in- 
volved is always symmetric under the interchange of Alice 
and Bob. With such symmetry, there is no advantage in 
having Bob perform the measurement instead of Alice 

More concretely, consider a round of communica- 
tions in a two-way scheme of entanglement manipula- 
tion. Suppose Alice has performed a measurement on 
I'f) = J2k \/^Wk)\K) obtained an outcome oi. 
She can work out the Schmidt decomposition PoJ^') = 
Sfe \/^kfc)l^fc) of the state that she now shares with 
Bob. Now Alice is supposed to tell Bob the outcome oi of 
her measurement and Bob then will perform a measure- 
ment with a set of local projection operators say {P;^°^}. 
However, it turns out that there exists a set of local pro- 
jection operators {p^^'»c'=} by Alice which will do essen- 
tially the same trick, as far as entanglement manipula- 
tion is concerned. Mathematically, we claim the following 
Proposition. 

Proposition 1: Given any pure bipartite state 
shared by Alice and Bob and any complete set of projec- 
tion operators {P;^°''}'s by Bob, there exists a complete 
set of projection operators {P;'^'*^'^}'s by Alice and, for 
each outcome I, a direct product of local unitary trans- 
formations Uf^ (8) up such that, for each I, 

{I (g) PP""") |*> = {up ® UP) (P,^''^° ® /) I*) . (2) 

Idea of the Proof: Consider a pure bipartite state in its 
Schmidt decomposition 

l*> =EVA|e»)A|e.)s (3) 

i 

shared between Alice and Bob. Notice that, by the defini- 
tion of the Schmidt decomposition, it is symmetric under 
the interchange of \ei)A and |ei)s. This is a mathemat- 
ical symmetry. Now, in Proposition 1, we promote this 
mathematical symmetry into a physical symmetry be- 
tween the actions of Alice and Bob in the context of en- 
tanglement manipulations. More precisely, if Bob applies 
a set of projection operators {PP°^} on his system and 
obtains an outcome /, the state \'$) will be transformed 
into some state, say, 

\^'')=J2^^\<)a\K)b- (4) 

i 

On the other hand, if Alice applies a corresponding set 
of projector operators {p^-^.hcej ^^j. syg^g^i (instead 
of Bob) , then we show that the corresponding outcome I 
will give her a state 

I*-'>-EVmIK')^IOb (5) 

i 

with exactly the same Schmidt coefficients as \ '^^)- Con- 
sequently, there exists a bi-local unitary transformation 



that will rotate the state |\E'"*) to l^*^). In this sense, the 
states 1^''^) and l^*^) are equivalent. The upshot is that 
there is no advantage for Bob to perform a measurement, 
in the place of Alice. In summary, as far as entanglement 
manipulation of a pure bipartite state is concerned, there 
is a total symmetry between the actions of Alice and Bob 

Proof: See Appendix [Aj. 

One can repeat the above argument and prove that all 
the rounds of measurements can be performed by Alice 
alone and Alice only needs to tell Bob her outcomes after 
the completion of all her measurements. 

What it means is that, for Alice and Bob manipulating 
a pure bipartite state, one can, without loss of generality, 
restrict oneself to schemes of entanglement manipulations 
using only one-way communications from Alice to Bob. 

Finally, it is a well-known consequence of measurement 
theory that the entire sequence of Alice's measurements 
can be described as a single generalized measurement. 
[One may argue this well-known result as follows. Ev- 
ery measurement consists of two steps — the interaction 
of a measuring devise with a system, and the "reading" 
of the measuring device, i.e. a unitary transformation 
and a projection. Now, any arbitrary sequence of inde- 
pendent measurements can be replaced by an equivalent 
single measurement, by simply letting all the interactions 
to be performed first, and reading all the measuring de- 
vices simultaneously at the end. In this case one can 
view all the independent measuring devices as a (more 
complicated) single measuring device, performing a sin- 
gle interaction with the measured system (the unitary 
transformation describing this interaction being simply 
the product of the unitary transformations describing the 
individual measuring devices) and followed by a single 
reading stage. Furthermore, even if the measurements 
are not independent from each other, i.e., some measure- 
ments depend on the results of previous measurements, 
we can still replace the sequence by a single measure- 
ment: In this case too the human observer can postpone 
"reading" the results obtained by the different measuring 
devices until the end. Indeed, there is no need for the ob- 
server to read the results of the measurements in order to 
tune the subsequent measurements accordingly. The en- 
tire process can be realized by the measuring devices in- 
teracting with each other as well as with the system under 
observation. Then, once again, we have a single measur- 
ing device, performing a single interaction, (only that the 
interactions between the measuring device and the sys- 
tem contain also some internal interactions between the 
different parts of the measuring device — corresponding 
to one part reading the result of the other), and a single 
reading stage.] 

In summary, the most general strategy of entanglement 
manipulation of a pure bipartite state is equivalent to a 
strategy involving only a single (generalized) measure- 
ment performed by Alice followed by the one-way com- 
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munications of the result from Alice to Bob (and finally 
local unitary transformations by Alice and Bob). 



B. One-way communications are provably better 
than no communications 



We have shown above that two-way communications 
are not necessary for the entanglement manipulation of 
a pure bipartite state — the most general entanglement 
manipulation strategy can be realized with only one-way 
communication. A natural question to ask is whether 
communication is needed at all. We show that, indeed, 
communication is necessary. That is to say that entan- 
glement manipulation strategies without communication 
cannot achieve all that could be achieved with commu- 
nication. The proof of relegated to Appendix |^. 

In conclusion, one-way communications generally give 
more powerful strategies than those without communica- 
tions. On the other hand, we proved in the above para- 
graphs that one-way communications is sufficient for any 
strategy. Combining these two results, we conclude that 
one-way communications is necessary and sufficient for 
implementing a general strategy of entanglement manip- 
ulations of pure bipartite states. 



III. OBTAINING A GIVEN MAXIMALLY 
ENTANGLED STATE <!>m FROM AN 
ARBITRARY STATE * 

Definition 2: (ni-ME-state: $m) We shall denote by 
$m a standard m- dimensional maximally entangled state 



1$. 



— T 



(6) 



where |i)A's (|i)B's respectively) form an orthonormal 
basis for a Hilbert space Ha {Hb respectively). In par- 
ticular, $1 is a direct-product, $2 is (equivalent to) a 
singlet and ^^21 is equivalent to q singlet pairs. In what 
follows, we shall call an m-ME-state. 

We now come to one of the main results of our paper. 
We consider the following particular problem. Suppose 
Alice and Bob share a pair of particles in some arbitrary 
pure state 4". By different entanglement manipulations 
strategies we can transform ^ into a given m-dimensional 
maximally entangled state $m ■ In general such a process 
does not succeed with certainty but only with some prob- 
ability Pm- Here we enquire what is the maximal proba- 
bility with which such a transformation could occur. 

Incidentally, one can even obtain a maximally entan- 
gled state whose degree of entanglement is greater than 
that of the initial state. Since the average degree of 
entanglement cannot increase, it is obvious that such a 
transformation always occurs with probability less than 



1 . To describe such a situation we sometime use the term 
"gambling with entanglement" - indeed, Alice and Bob 
try to achieve a better than average outcome while tak- 
ing the rise of loosing entanglement if the result turns 
out to be unfavorable. 

Definition 3: {Pm^^) For ^^y positive integer to, we 
define p^"^^ to be the supremum over all manipula- 
tion strategies of the probability Pm of getting an m-state 
from a pair initially in the state 

We will prove the following theorem: Main Theo- 
rem (Theorem 0) : If we write the initial state ^' in or- 
dered Schmidt decomposition as ^' = J2iLi y^\0'i)\bi), 
the supremum probability p^^'^ of obtaining $,„ over all 
possible entanglement manipulation strategies is given by 
the following. 

i) If TO > iV (N being the number of terms in the 
Schmidt decomposition of ^P), then p^^^ = 0. 

ii) If TO < A^, then 



MAX 



mini<:r<m — (A„ 

- - r 



-1 + A„i-r+2 + 



+ Aat). 

(7) 



The proof of this main theorem is divided into two 
parts. In the next section (section IV), we will derive an 
upper bound on p^^'^ (see Theorem 1). In section 0, 
we demonstrate an explicit strategy that saturates the 
bound and is, thus, optimal. (See Theorem 2.) 



IV. UPPER BOUND ON Pt 



THEOREM 1 



Theorem 1: If we write the initial state 'i> in or- 
dered Schmidt decomposition as = J^iLi V\\<^i)\bi) , 
the supremum probability pf^l^^ of obtaining over 
all possible entanglement manipulation strategies satis- 
fies the following. 

i) If m > A^ (N being the number of terms in the 
Schmidt decomposition of ^P), then p^^^ = 0. 

u) If TO < A^ , then 



MAX 



< miriKrKm — (A„ 
- - J. 



-1 + A„i_,-+2 + 



+ Aat). 
(8) 



A. The number of Schmidt decomposition terms can 
never increase: Part i) of Theorem 1 

The following Lemma is useful. 

Lemma 1: The number of terms in a Schmidt de- 
composition can never increase under local measurements 
and classical communications [|l9| . 

Proof: Let us suppose that the initial state 1$) = 
'^iLi \/Ai|'^j:) l^i) has only A^ non-vanishing terms in its 
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Schmidt decomposition. For each measurement outcome 



the resulting state P; 



Alice 



Alice 



N 



I on |<i>) 

[where |a-) is the projected state P;'^''^^|ai)] can be ex- 
pressed as a sum of N terms. Consequently, its Schmidt 
decomposition must have at most N terms. QED. 

Proof of Part i) of Theorem 1: As a corollary 
of Lemma 1, for an initial state |<I>) — J2iLi VKWi)\bi) 
with only N non-vanishing terms in its Schmidt decom- 
position, p^^^ = 0, if m > TV. QED. 

This leads to the following apparent paradox. Sup- 
pose Alice and Bob share s standard singlets. What is 
the probability that they can gamble successfully and get 
S (> s) singlets? Naively, one might expect the probabil- 
ity to be non-zero: One may use quantum data dilution 
1^ to dilute s standard singlets into say 5" pairs of |<I>) 
each of entanglement £^(1$)) = s/S and then apply the 
Procrustean (i.e., local filtering) method |^ of entangle- 
ment gambling to each of S pairs of 1$). For each |$), 
the Procrustean method gives a non-zero probability say, 
p', of getting a maximally entangled pair out of it. So, 
it looks as if there would be a non-zero probability {p')^ 
of getting S singlets from s singlets. But, as we have 
seen above this argument is erroneous - the probability 
of getting S singlets out of gambling with s singlets is 
strictly zero. The reason is that quantum data dilution 
is an inexact process which is valid only on average. 



B. An Upper Bound on p"^^: Part ii) of Theorem 1 

It remains to prove Part ii) of Theorem 1. It is conve- 
nient to introduce the following notation. 

Notation: (P™) We denote the r-th bound in Theo- 
rem 1 by B™. i.e.. 



— ('^m-r+1 + Xm-r+2 

r 



+ Aat). 



(9) 



Restatement of Part ii) of Theorem 1: Given a 
state 1 4*) with the ordered Schmidt decomposition j^I') ~ 
\f^i\o-i)\i>i) ^ tbe supremum probability p^'^'^ of ob- 
taining an m-ME-state out of manipulating Y^) satisfies 
a set of constraints p^^'^ < P™ for 1 < r < m. 

Idea of the proof of Part ii) of Theorem 1: For a 
fixed r, if the right hand side, P™, is zero, then there 
are only m — r terms in the Schmidt decomposition of 
l^*). From Lemma 1, Alice will definitely fail to get an 
m-diniensional maximally entangled pair state because 
there will be at most m — r terms in the Schmidt decom- 
position of the resulting state. We will turn this argu- 
ment around to show the following. If Alice does succeed, 
the remaining r (i.e., from m — r -I- 1-th to m-th) terms in 
the maximally entangled state must have come from the 
remaining (i.e., from m — r + 1-th to 7V-th) terms of the 
Schmidt decomposition of the original state |<I>). [Sur- 
prisingly, classical reasoning is, in fact, valid here. This 



is because when one considers the reduced density matrix 
of Alice, Bob's system provides a "record" for its history. 
Therefore, no interference effect is possible. See below.] 
Let us multiply both sides of the inequality and consider 
the following new inequality: rp^^^ /m < rB"^/m. Now 
the left hand side of the new inequality is simply the prob- 
ability that Alice's state gets projected into the remain- 
ing r terms. [There is a supremum probability p^^'^ 
of getting successfully an m-dimensional maximally en- 
tangled state and a conditional probability r/moi get- 
ting projected in an r-dimensional subspace of the m- 
dimensional space in the support of Alice's system.] It 
must therefore be constrained by the probability of Bob's 
system getting projected into the space spanned by the 
TO — r -f 1-th to A^-th terms in |$), which is given by the 
right hand side. 

Proof of Part ii) of Theorem 1: Given an initial 
state 1$), for 1 < r < m, we decompose |$) = I*!"'!') + 1*2) 
where \<^\) = E"rVA7|a,) |&^) [Define = 0.] and 

= Et,n-r+iVA7l«»>l^*>- [Define |$^) = when- 
ever N < m — r + 1.] Alice and Bob now attempt to 
manipulate |$) into an m-ME-state. Alice can divide up 
the outcomes into two sets: {si, S2, • • • , Sp} (success) and 
{/ii /2, • • • , fq} (failure). Let us consider a successful out- 
come si. Then P,, |$) = P^, |$^) + P^, |$^) is an m-ME- 
state. Denoting by (similarly p^^^' where i = 1 or 2) 
the un-normalized density matrix Tr^Ps, |$)($|Pj^j (sim- 
ilarly TrBPsJ(f>[)($[|Pj' where i = 1 or 2 respectively), 

1 S! r,sj , r,si 

we have pX = Pa,i+ Pa,2- 

We emphasize that the interference term arising from 
TrBPsJ$i)($2|Pl is identically zero. This is because, 
when one considers the reduced density matrix of Alice, 
Bob's system provides a "record" for its history. In tak- 
ing the partial trace over Bob's system, all the interfer- 
ence terms disappear. It is very interesting that classical 
intuition is valid here. This greatly simplifies our discus- 
sion. 

The supports satisfy supp{p^J^\) C supp{p'X). Since 
supp{p^X'^l) has dimension at most m—r and yet supp(p^2) 
has dimension to (Pg, 1$) is an m-ME-state.), we can pick 
r orthonormal vectors \ul'),\u2), ■ ■ ■ ,\up) in supp{p^2) 
such that {ul'\v) = for all \v) G supp{p^^\). Let us de- 
fine the projection operator P^^^s, = From 
its definition, it is clear that P^«, P^aiP}^h — 0- For a fixed 
but arbitrary strategy of entanglement concentration, let 
us denote by pj^^ the probability of successfully getting 
an m-ME-state. Therefore, 



rpZ'/m 



Tr. 
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\ Si , 
= Am-r+l + Am-r+2 + ' ' ' + Xn 



(10) 



for 1 < r < m. The equality sign in the second line holds 
because is proportional to the identity matrix in a 
TO-dimensional space and its trace is proportional to its 
probability of occurring. Since the total probability of 
success is pj^^ and P^s, projects an m-ME-state into an 
r-dimensional subspace of the m-dimensional space, the 
probability of this occurring is clearly rp^'' /m. 

Now, one takes the supremum over all entanglement 
manipulation strategies in Eq. ( p^ to find that p^^^ 1^ 
f {Kn-r+i + Xn.-r+2 + • • • + Aat ) = for 1 < r < m. 
QED. 



V. OPTIMAL STRATEGY AND VALUE OF 



M 



THEOREM 2 



Theorem 1 gives an upper bound to the probability 
Pm^'^ ■ We now prove that an optimal strategy actually 
saturates this bound. In other words, we have: 

Theorem 2: Given a state Y^) = X^iLi V^loi) l^i) 
(where Ai > A2 > • • • > Ajv) with only N non- vanishing 
terms in its Schmidt decomposition. There exists a way 
to convert ^E* into an m-dimensional maximally entan- 
gled state with probability min,.g{i 2.- -.m}7^(Am-r+i + 
\m-r+2 H 1- Aat) = min,._B;". 

Proof of Theorem 2: Let us separate the proof into 
two cases: (a) minr-B™ = 1 and (b) miur-B™ < 1. 



A. Case (a) of Theorem 2 

Case (a): Let min^-B™ = 1. We shall prove that for 
an optimal strategy, the probability of getting an m-ME- 
state is 1. 

It is convenient to start with a simple case, namely 
transforming maximally entangled states into maximally 
entangled states of lower dimension. We will prove the 
following. 

Lemma 2: There is a way of transforming with prob- 
ability 1 any maximally entangled state into a maxi- 
mally entangled state of lower dimension. Consequently, 

pMAX < pMAX if ^ > s > 1. 

Proof: First, consider the case r = 3 and s = 2. (Here 
we omit the obvious normalization factors.) A maximally 
three-dimensionally entangled state has the Schmidt de- 
composition |u)Ai3 = |l)A|l)i3 + |2)A|2)B-H|3)A|3)B. We 



now show that it can be reduced with certainty to a stan- 
dard singlet |1)a|1)b -I- |2)yi|2)B. Suppose Alice prepares 
an ancilla in the state |0)a and evolves the system in 

such a way that \Q)a\l)A ^ (|2)a + |3)a)|l)A, |0)a|2)A ^ 
(|l)a + |3)a)|2)A, and \0)a\i)A ^ (|l)a + |2)a)|3)A. The 
entire state will evolve as follows: 

|0)q|u)ab 

= \Q)a{\l)A\l)B + \2)A\2)B + \i)A\i)B) 

^ \2n)aAB + \in)aAB + aAB + |322)aAB 

+ |133)aAB+ |233)aAB 
= \l)a{\22)AB + |33)ab) + \2)a{\n) AB + 133)^5) 

+ |3)a(|ll>AB + |22)AB). (11) 

Now Alice measures the state of her ancilla and ob- 
tains a singlet shared with Bob. The exact singlet which 
is obtained depends on the result of Alice's measurement, 
but it can always be transformed into the standard one 
"^(|11)ab + \22) ab)- This can be realized by Alice com- 
municating to Bob the result of her measurement, such 
that both of them know which singlet has been obtained 
and then having both of them perform the appropriate 
unitary rotations. 

A similar proof can be constructed to show that, start- 
ing with a fc-state (a maximally entangled pair of fc-state 
particles), Alice and Bob can with probability 1 convert 
it to a (fc— l)-state (maximally entangled pair of (fc — 1)- 
state particles). See Appendix ^for details. QED. 

We remark that using Lemma 2 one can convert with 
probability 1 a maximally entangled state of dimension 
i into r standard singlets provided that i > 2''. Just 
note that, as mentioned before, r standard singlets are 
equivalent to a single 2''-dimensional maximally entan- 
gled state, and use the above lemma. This simplifies a 
related discussion made in Ref. |^ and raises the proba- 
bility of success from about 1 — e to 1. 

Now we turn to the general case. The first thing to 
notice is that the condition min^i?™ = 1 is completely 
equivalent with the constraint that the largest normalized 
Schmidt coefficient is smaller or equal to 1/m. This is 
because of the following. 



Am-r < ■ • • < Ai < 1/m 



implies that 



Ai + A2 + h Xm-r < —{rn- r). 

m 



Since Ai + A2 



Xn = 1, we find that 



1 r 

Xm-r+l H h Aat > 1 (m - r) — , 

m m 

which is equivalent to 



-B™ — — (Am-r-l-l + ■ • • + Xn) > 1- 

r 



(12) 



(13) 



(14) 



(15) 
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Recall also that B,^ = 1. Therefore, we conclude that, if 
A < 1/m, then minr-B™ = 1. Conversely, if Ai > 1/m, 
Bm-i < 1- Combining these two results, we see that 
min^B™ = 1 iff Ai < 1/m. 

Idea of the Proof of Case (a) of Theorem 2: Naively, 
one might proceed by extracting a m-ME-state from 
^! iteratively. At each step, the state ^' = *i + *2 
such that '^'i is an (unnormalized) m-ME-state and 
is residual state that, when properly normalized, still 
satisfies min^-B™ = 1- One simple way to ensure that 
min^i?™ = 1 (or Ai < 1/m) is always satisfied by (if 
properly normalized) is to allow only the first m Schmidt 
terms to contribute to ^[ and, therefore, the Ai term of 
^2 decreases fast enough. 

However, this does not quite work as an iterative pro- 
cedure. The reason is that, at some point of such a pro- 
cedure, the TO-th Schmidt coefficient of the state ^ will 
become degenerate with the m + 1-th and possibly other 
coefficients. In other words, A™ — Xm+ii etc. Dealing 
with this problem is one of the major technicalities in 
the proof. Let us start by making the following defini- 
tion. 

Definition 5: (precursor state) Consider a state of 
the form 



i) obtain a precursor state which, as shown in 
Lemma 3, can readily be reduced with probability 1 to 
an m-dimensional maximally entangled state; or 

ii) obtain a residual state whose (m-th) Schmidt de- 
generacy number is increased by 1, while still obeying 
the relation min^i?™ = 1 when properly normalized. 

If Alice and Bob obtain an m-ME-state, they have ac- 
complished their task. If they get a residual state, they 
repeat the procedure. Since with each step the residual 
state increases its degeneracy number by 1, we are cer- 
tain that in a finite number of steps (< N) either Alice 
and Bob obtain an m-ME-state, or end up with a resid- 
ual state $7v, which, by Lemma 2, can subsequently be 
converted with certainty to an 

We now describe each step in more detail. Suppose the 
initial state in ordered Schmidt decomposition is 



N 



(17) 



Suppose further that Am is (p -I- g')-fold degenerate such 
that 



A 



■m—p-j-l 



Xr 



X 



m+q- 



(18) 



I pre / 



1 



J — 1 J— m— p+l 



(16) 

where p > and g > 0. Let us call it a precursor state 
of an m-ME-state. 

Remark: Note that the case q — corresponds to an 
m-ME-state. For g > 0, a precursor is a coherent sum 
of an (m — p)-ME-state and an (p -I- q)-ME-state. The 
factor i^^y^^ in the definition of |\I'™^p^'?) is needed for 
the following important result. 

Lemma 3: A precursor state of an m-ME-state can 
be converted with certainty an m-ME-statc. 

Proof of Lemma 3: The proof is essentially a gener- 
alization of the proof of Lemma 2. See Appendix 

In our proof, it is convenient to make use of the fol- 
lowing definition. 

Definition 6: (m-th Schmidt degeneracy number) For 
any pure bipartite state 5* in an ordered Schmidt decom- 
position l^?) = X^i^i VXili) A\i) B , let us define the m-th 
(m < N) Schmidt degeneracy number (or simply the de- 
generacy number when there is no ambiguity) to be the 
number of Schmidt coefficients that are degenerate with 

Am- 

Proof of Case a) of Theorem 2: Consider the 
entanglement manipulation of a general state j^*) = 
Si=iV^K)K) satisfying min^i?™ = 1. We construct 
a multi-step procedure such that in each step Alice and 
Bob either: 



The decomposition of | ^f) into a precursor and a resid- 
ual state is done by the attachment of an ancilla prepared 
in the state |0)a and a subsequent measurement by Alice. 
For 1 < i < m — p, the evolution goes as: 



\\0)a\i)A 



-|l),|z)^ + A--|0),|z)a, 
m y m 



(19) 



where |0)a and \l)a are orthonormal. For m — p -I- 1 < 
i < m + q, it goes as: 

A^|0)a|«)A 



{-)i-^)\l)a\^)A + A: - (-)(^)|0).|z)^. (20) 
m p + q y m p + q 



For m + q + l<i<N, the state is unchanged, i.e., 

\0)a\i)A ^ \0)a\i)A- (21) 

Hence, we find that 



|0)a|*) 

v/^|l)a|vI/™f^>+x/T^|0)a|*. 



(22) 



where 



m+q 



= 1 i=m-p+l ' y 



(23) 
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is the precursor and 



|*rcs> = (1 - a)-'^' 



m-\-q 



i—m—p-\-l * 



TV 



(24) 



is the residual state and a is the minimal value 
needed for a new degeneracy to occur in Schmidt 
coefficients of the residual state IvPros)- i-e., a = 



mjp+q) 



(A. 



A 



m+q+l. 



thus achieving either (1) X'^^^ = Ki-p+i or (2) A;„+g = 

Now Alice measures the state of the ancilla. If the 
outcome is "1" , she gets a precursor state which can be 
converted with certainty to an m-ME-state. If the out- 
come is "0" , she gets a residual state with its degeneracy 
number increased by 1. 

It is also easy to see that, just like the original state ^E", 
the intermediate residual state l^'ios) also has the prop- 
erty that min^i?™ = 1. The final residual state will be 
totally degenerate and, hence, has the form $7v- This 
multi-step method establishes our proof. QED. 



B. Properties of 

1. Lemma 4 

Before moving to Case (b), let us prove some lemmas. 
For any initial state |^), the bounds in theorem 1, i?™ = 
^(Kn-r+i + \n-r+2 H ^ Ajy), obcy the foUowiug. 

Lemma 4:: If B^J^^ > B™, then B;^2 > Bl"+i- 

Remark: In other words, for a fixed m, consider B,™ as 
a function of r. Once it starts to increase, it will continue 
to do so. 

Proof: See Appendix |^. 



2. Lemma 5 

By adding the condition (which is valid for case (b) of 
Theorem 2) that min^-B™ < 1, the following Lemma can 
be proven. 

Lemma 5: Given miurS 
ri such that Sf' > B^" > 

Proof: See Appendix 

Remark: Since B™ is defined to be ^{Xm-r+i + 
Xm-r+2 + • • • + Aat), in terms of A^'s, the conditions that 



< 1, there exists a unique 
• > SI? < 



< 



BY" > > ■■■ > Bl!l < B^^^^ <■■■ <Bl 
written as the following set of equations: 



Am-i < A„j + A„i-|_i + • • • + Aat 

Am-2 1^ '7^{^m-l + Am + • • • + Aat) 
... < . . . 

1 



1 can be 



ri + l 



< 



(^1 ~ 1) 



(A 



m— ri +2 



+ A, 



■rn— ri +3 



Aat) 



1 



ri > 7 ^{Xm-ri+1 + Xm-rx+2 



■■■ > 

Ai > 



(m-1) 



(A2 + A3 + ----I-Ajv). 



(25) 



Inspired by the above discussion, let us consider the 
following set of equations. 

Am-l < Am + Am+i + ■ ' ■ + Aat 
Am-2 < 2(^™-i + + • • • + Aat) 

... < . . . 

Am-r < — (Am-r + 1 + Am-r+2 + ' • • + Aat) 

r 

... < . . . 



Ai < 



1 



-(A2 + A3-H--- + AAr). 



(26) 



(to - 1) 

Consider putting Am-i, Am-2, ■ • • , Ai into the left hand 



side of Eqs. (26) one by one, we find from Eqs. ( |25| ) 
that Am-i, Am-2, • • • , Am-n-i-i Satisfy Eqs. (pq) whereas 



An 



m—r-i , Xjfi—ri —1 , 



■ , Ai violate Eqs. (Eq). Let us focus 



on the point of first violation, namely Am-ri • We notice 
that the maximal value of Am'^n that will still satisfy 
Eq. (pi) is 



\ max 
^rri—ri 



1 

ri 
B. 



(Am-ri+i + ■ ■ • + Aat) 



n_ 

TO 

min^S;" 



(27) 



With lemmas 4 and 5 proven, we now return to the 
proof of case (b) of theorem 2. 



C. Case (b) of theorem 2 

Case (b): min^S;:' < 1. 

Idea of our proof: We construct an explicit strategy 
which saturates the bound Pm = min,._B™ as follows. By 
attaching an ancilla prepared in the state |0)a to the sys- 
tem \^), Alice divides up l^*) into two pieces — successful 
and failing pieces — by the following evolution: 



\0)a\^) = \l)a\^s) + \Q)a\^f) 



(28) 
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where |0)a and \l)a are orthonormal states of the ancilla, 
|\l/s) (when properly normalized belongs to case (a), i.e., 
minr-B™ = 1 and hence) gives a probability 1 of success 
and 1 4"/) (has less than m terms in its Schmidt decompo- 
sition and hence) gives a probability of success. Alice 
now reads off the state of the ancilla. A state |l)a indi- 
cates a success and |0)a a failure. One can then read off 
the probability of success of this explicit strategy from 
the norm of \^s)- It turns out to be equal to min^-BJ". 

Proof of case (b) of Theorem 2: Recall from 
Eq. (^) that the maximal acceptable value of the {m — 
ri)-th Schmidt coefficient for it to satisfy Eq. (p6|) is 



A,, 



B 



~ min^i?!: 
m 



(29) 



Now the successful piece \"^s) in Eq. (28) is ob- 
tained by trimming the redundant contribution to 
Ai, A2, • • • , Am-n from This is done by the attach- 
ment of an ancilla prepared in the state \0)a- The evolu- 
tion goes as follows: 



+ ^J>^^-^T-r^\0)a\^)A (30) 

for 1 < i < TO — ri. For to — ri + l<i<A^, the evolution 
is 



\i\l)a\i)A- 



(31) 



Alice now reads off the state of her ancilla. We shall 
argue in the following paragraph that an outcome "0" 
means that Alice has failed in getting an TO-ME-state 
whereas an outcome "1" means that she has succeeded 
in obtaining a state satisfying minj.i?™ = 1, which by 
Sec. VI B can be reduced with certainty to an m-ME- 
state. 

If the outcome is "0", the resulting (failing) state /) 
has unnormalized Schmidt coefficients Ai — X'^'^n j ~ 



\ max 
^m—r 



\ max n 
^m—ri ) ^) ' 



,0. 



Since it has at 



most TO — ri terms in its Schmidt decomposition, it 
follows from Lemma 1 that it gives a zero probabil- 
ity of getting a TO-ME-state. On the other hand, if 
the outcome is "1" , the un-normalized Schmidt coeffi- 
cients of the resulting (successful) state |5's) are given by 
A™'^^! I ' ' ' I '^m'^ri ! '^"i-ri+i, A„i_ri+2, • • • , Aat. i.e., the 
first TO — ri-th Schmidt coefficients are all replaced by 
^m'^n- By construction jvPs) belongs to Case (a) of The- 
orem 2. Therefore, it always succeeds to give an TO-ME- 
state. Moreover, using Eq. ( ^ ) it has a norm 

(m - rOA^^^^'V^ + Xrn-r,+l + • • • + Ajv 
TO 

= — (Am-ri + 1 + A„i-ri+2 + ' ' ' + A^v ) 

n 



min^B™. 



(32) 



This proves that our explicit strategy saturates the bound 
and completes our proof for the case (b) of Theorem 2. 
QED. 

Recall that Theorems 1 and 2 combined together are 
equivalent to our Main Theorem (Theorem 0). Since we 
have by now proven both Theorems 1 and 2, our Main 
Theorem has been established. 



VI. THE LAW OF LARGE NUMBERS 

In this section, we derive some constraint on the prob- 
abilities of having large deviations from the average prop- 
erties. Consider the question raised in the abstract and 
the introduction: Can collective measurements defeat the 
law of large numbers? We now show that the answer is 
no. That is, suppose Alice and Bob share n pairs of 
particles, each pair in a state j^*) with an entropy of en- 
tanglement E{\'^)). We shall show in Theorem 3 below 
that the maximal probability of obtaining nK singlets, 
with K > E{\'^)), goes to zero as n goes to infinity. 

Once again, we want to emphasize that this result does 
not follow automatically from the fact that on average we 
cannot obtain more than nE singlets. Indeed, an average 
of nE singlets could conceivably be obtained if with a 
non-negligible probability p = E/K we get nK singlets 
while with probability I ~ E/K we get no singlets at all. 

Theorem 3: In the entanglement manipulation of n 
pairs 5", the optimal probability (over all possible strate- 
gies) of getting nK singlets, p^k'^ , tends to 1 (0 respec- 
tively) when K < i?(|*)) (K > i?(|*)) respectively) in 
the limit n oo. 

Remark: It can also be shown that, as a function of K , 
the jump from to 1 in the value of P2n'K'^ occurs in a 
region of width 0(n~^/^) around E{\^)). We shall skip 
the proof here. 

Proof of Theorem 3: That p^ii^ tends to 1 in the 
large n limit when K < E{\'^)) follows trivially from 
Bennett et a/.'s reversible strategy Q and Lemma 2. 
Let us now consider the case K > E{\'^)). Here we 
view the n pairs as a single pair in state ^' = 5"", 
by considering all n Alice's (Bob's) particles to form a 
single (more complex) quantum system. Similarly, the 
final nK singlet pairs can be viewed as a single pair in 
a 2"-'^-dimensionally maximally entangled state. Then 
the problem of extracting nK singlets from the n pairs 
^' can be rephrased as the problem of extracting an 2"^- 
dimensionally maximally entangled state from 5*. The 
maximal probability for success is p^x'^ which can be 
bounded by using Theorem 1. 

Let Ai's represent the Schmidt coefficients of '5; they 
are also the eigenvalues of Alice's reduced density matrix. 
Since Alice's reduced density matrix has a product form, 
(originating from the n pairs j^*)) its weight must be 
concentrated on a 'typical' space of dimension roughly 
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2"-^. [Here we simply our notation and use E to denote 
This is essentially the law of large numbers in 
classical probability theory. See also quantum noiseless 
source coding theorem Q .] Let us pick a Kq such that 
K > Kq > E. Since Kq > E, given any S > 0, for a 
sufficiently large n, we have that X]i=2"^o K < ^ where 
t is the number of terms in the Schmidt decomposition 
of jvl/). [An 'atypical' space has a small weight.] Let 
us apply theorem 1 to the case N — t", m — 2"^ and 
m— r+1 = 2"^". Notice that r/m > 1/2 for a sufficiently 
large n. Hence, pi^^^/2 < rp^^^^/m < EC™-r+i < 
S. Substituting m = 2"^ back, we get p^Ik^ ^ as 
n oo. QED. 

In fact, any particular strategy which transforms n 
copies of the state ^ into an average of nE singlets gives 
a singlet number probability distribution similar to that 
of reversible strategy . This follows immediately from 
the result in the next section. 



VII. SPECIAL STRATEGIES 

In the previous sections we were interested in the ques- 
tion of what is the maximal probability to transform an 
arbitrary entangled state ^ into a given maximally en- 
tangled state, say (where m is some given fixed di- 
mension). What happens to the original state 5* in those 
cases in which the transformation into is not suc- 
cessful was not important for us. We will now consider 
special manipulation strategies which are such that for 
every outcome the initial state is transformed into some 
maximally entangled state. (Note that, by extension of 
language, we denote direct product states as "maximally 
entangled states of dimension zero" ) . Such a strategy S 
can be characterized by the probabilities pq{S), pi{S), 
... with which the initial state ^' is transformed into 
$1,... respectively. 

A convenient way to describe this probability distri- 
bution is to use instead of the probabilities Pra{S) the 
"cumulative probability" p^*(5). 



MAX 



But why should the sum Pm{S) + Pm+i{S) 



(33) 



In the present section we find an upper bound on the 
cumulative property for an arbitrary strategy S. 



plniS)<p: 



MAX 



(34) 



where pf^^'^ is the supremum probability over all possi- 
ble strategies to convert \E' into an m-dimensional max- 
imally entangled state (an m-state). Since PmiS) rep- 
resents the probability to convert ^ into an m-state by 



using the particular strategy S while p. 



MAX 



represents 



the supremum probability (over all possible strategies) 
to convert ^ into an m-state, it is obvious that Pm {S) < 



p: 

be smaller than p^^'^'^- 

The reason is that, as we have shown Lemma 2, a 
maximally entangled state of dimension k can always 
be converted with certainty, into a maximally entangled 
state of smaller dimension m (m < k). Then, sup- 
pose that Alice and Bob, by using the strategy S con- 
vert \1/ into a maximally entangled state of dimension k 
larger than m. They can then convert, with certainty, 
this state into a maximally entangled state of dimen- 
sion equal to m. Consequently, by appending this reduc- 
tion strategy to the strategy S, we obtain a new strat- 
egy S' which converts 4" into an m-state with proba- 
bility Pm{S') = J2k>mPm{S) = P™*(5), (whilc having 
zero probability to convert 4" into maximally entangled 



MAX 



IS 



states of dimension larger than to). Now, as p. 
the supremum probability (over all possible strategies) 
of converting ^! into an m-state, we must have in partic- 
ular Pm^'^ > Pm[S') — p^°-^{S) which proves the bound 
in Eq. (H). 



VIII. 



NON-EXISTENCE OF UNIVERSAL 
STRATEGY 



As shown in Section VIL for any strategy S which 
transforms an arbitrary state ^' into different maximally 
entangled states $m, the cumulative probability of 
obtaining some maximally entangled state of dimension 
TO or larger is bounded by 



tot < MAX 



(35) 



We have also seen in the previous section that for any 
particular m there exists a strategy which saturates this 
bound (the strategy which yields with probability 
equal to p^^'^ ^-nd k > m with zero probability). 
The question is whether there exists a "universal" strat- 
egy 5""™ whose cumulative distribution saturates this 
bound for all m's. The reason we call such a strategy 
"universal" is that such a strategy, followed by the reduc- 
tion of some of the final maximally entangled states into 
maximally entangled states of lower dimension could gen- 
erate any possible distribution consistent with the bound 
( p5| ) . We shall show however that such a universal strat- 
egy does not exist. 

Proof: We show that a universal strategy generally 
cannot exist for the case = 3 and m = 2 or 3. Consider 



(36) 



I*) = VA7|11> + v^|22> + 



with p^'^^ = 1 and A2 + A3 - Ai > 0. Assume, by 
means of contradiction, that a universal strategy does 
exist. We shall use projection operators rather than pos- 
itive operator valued measures (POVMs) in our discus- 
sion. As noted in Sec. 2, there is no loss of generality. 
Let Pi, P2, ■ ■ ■ , Pr be the set of all projection operators 
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by Alice that give some 3-state in a particular univer- 
sal entanglement manipulation strategy. By definition, 

(A + -P2 H h i^r)!*) has a norm Note that 

it follows from Theorem 2 that p^^^ = 3 A3. Since 
pMAX = 1^ it is necessary for a universal strategy that 

the residual state = (1 - Fi - P2 

has p^^-^ = 1. But this requires the squared eigenval- 
ues of the reduced density matrix of to satisfy the 
constraint Aj + A3 — X'^ > 0. We shall show that this 
is generally impossible. The point of our argument is 
that, as shown by Lemma 6 below, the extraction of a 
3-state will lead to an equal decrease in all three squared 
eigenvalues (of the reduced density matrix of |^'r)). i.e., 
A'^ = Ai -pf ^^/3 = A, - A3. Therefore, unless Ai = A2, 
the residual state has Aj-t-Ag — A'^ A2 — Ai < 0, thus 
contradicting the requirement that P2^ H"^ r)) = 1- 

In the above proof, we have used the following Lemma. 

Lemma 6: Consider a state 



Ai|ll) + VA2|22) + v/A3|33) 



(37) 



in Schmidt decomposition. Any strategy that extracts 
a 3-ME-state with a probability p from will lead to 
an equal decrease in all three eigenvalues of the reduced 
density matrix of the un-normalized residual state, i.e. 
A- = Xi—p/3 where the A-'s are eigenvalues of the reduced 
density matrix of the un-normalized residual state. 
Proof of Lemma 6: See Appendix 

IX. MIXED STATES 

Let us now consider the case when Alice and Bob share 
a mixed initial state pini. Since pmi is impure, one gener- 
ally cannot write it in terms of Schmidt decomposition. 
More importantly, even if pini happens to be symmet- 
ric under the interchange of Alice and Bob, there is no 
guarantee that the intermediate states that they get dur- 
ing the entanglement manipulation process will respect 
such a symmetry Therefore, the symmetry argu- 

ment much emphasized in the earlier part of this paper 
will no longer be valid. Manipulations of a mixed state 
using two-way communications are generally more advan- 
tageous than a one-way strategy. Indeed, Bennett et al. 
have shown that one-way capacity and two-way capacity 
for purification are provably diS'erent . 

We also proved in Section || that, for a pure bipartite 
state, entanglement manipulation strategies with one- 
way communications are provably better than no com- 
munications. Notice that one-way communications is 
useful for an entanglement manipulation strategy that 
has a probability of success strictly between and 1 , but 
not for (deterministic) quantum error correction ||^ . The 
role of communications in entanglement manipulations 
deserves future investigations. 

For a mixed state, there are generally four distinct 
supremum probabilities to consider: pf^, p. 



and corresponding to entanglement manipulation 
schemes with two-way communications, one-way commu- 
nications from Alice to Bob, one-way communications 
from Bob to Alice and no communications respectively. 
While simple bounds on the success probability for ma- 
nipulating mixed states may be derived, many interesting 
questions remain unanswered. For example, we do not 
know the value of ^2"-^ the asymptotic limit n — > 00 
in the region -Do(p) < ^ < E{p) where -Do(p) is the 
entanglement of distillation (without any classical com- 
munications between Alice and Bob). 

To conclude, we expect the subtle interplay of the con- 
cepts of probability, classical communications, collective 
manipulations and symmetry in the case of mixed states 
to be even more challenging than the pure state case con- 
sidered in this paper. 



X. OPEN QUESTIONS ON PURE STATES 

Even for the case of a pure initial state, many interest- 
ing questions remain unsolved. For instance, what is the 
supremum probability p'^ of getting an m-ME-state with- 
out any classical communications? Notice that Bennett 
et al.'s reversible strategy Q (but not the local filter- 
ing strategy [^) is an example of a strategy which does 
not require any classical communications. It is an open 
question whether one can do better than Bennett et al.^s 
strategy without any classical communications. 

We emphasize that the symmetry that we have found 
here applies not only to entanglement concentration, but 
also to all types of entanglement manipulations including 
entanglement dilution For instance, the usual proce- 
dure of entanglement dilution via teleportation falls in- 
side our general framework of using a single generalized 
measurement by Alice followed by one-way communica- 
tions of its result to Bob and a subsequent unitary trans- 
formation by Bob. A more systematic investigation of 
our formalism in applications beside entanglement con- 
centration may prove rewarding. 
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APPENDIX A: PROOF OF PROPOSITION 1 

Let us write 5* in its Schmidt decomposition: 

l*> =I]0^|afc)IM- (Al) 

A; 

Consider any of Bob's projection operator 

hi 

After the projection, the state he shared with Alice be- 
comes 



(A3) 



i,k 



On the other hand, if, instead of Bob, Alice performs a 
measurement using the corresponding operator defined 

by 



(A4) 



an outcome I will give the state 



P, 



Alice 



(A5) 



i,k 



Let us consider unitary transformations U {\ai) — > 
I]pWip|ap)) and V (|6fc) J2q^kq\bq)) that will put 
in Schmidt decomposition, i.e.. 



iu®v)\^^)^J2VT^K)\bp) 



(A6) 



From the definitions of U and V and Eqs. (A5) and (AC), 
we find that 



(A7) 



ik 



Now consider {V (g) U) \ ^^). 

iy ®U)\^^) ^YH^^'^^^kVkqU^p\aq)\hp) 
ik pq 



pq 



(A8) 



where Eq. ( [A7|) is used in the second equality. 
From Eqs. ([A6D and (^), we find that 



{I®PP°^)\^)^{U{'®UP) {Pi 



Ali. 



(A9) 



where Uf" = V-^U and UP = U-^V. QED. 



APPENDIX B: PROOF OF THE NECESSITY OF 
ONE-WAY COMMUNICATION IN 
ENTANGLEMENT MANIPULATIONS OF 
BIPARTITE PURE STATES 

Definitions 2 and 3 in the main text are needed for this 
proof. The basic reason for the necessity of classical com- 
munication is that, whenever p™"^ as defined in the text 
is strictly less than 1, Bob generally needs Alice's help 
to figure out whether the entanglement manipulation is 
successful or not. 

Consider the example of \'i>) — a\ll) + b\22) where 
a > 6 > 0. We shall first argue that the supremum prob- 
ability of obtaining a sing let satisfies < pf^^ < 1: 
Since the local filtering strategy in Ref. gives a non- 
zero probability of getting a singlet, we have p^^^ > 
pk>cai fiitciing ^ Morcover, since the entanglement 
< 1 and the average entanglement cannot increase 
upon entanglement manipulations, the supremum proba- 
bility p^^-^ of getting a singlet out of entanglement ma- 
nipulations is less than 1. 

Now consider any strategy that gives < P2 < 1. Let 
us divide up its outcomes into two classes: {si , S2, ■ ■ ■ , Sp} 
(success) and {/i, /2, • ■ ■ , fq} (failure) and denote the un- 
normalized reduced density matrix of Bob for an outcome 
Si {fj) by pf°'' {Pf°''). Since < p2 < 1, Bob needs to de- 
termine the outcome of the entanglement manipulation 
by distinguishing with certainty between the two den- 
sity matrices p^t,,, = Pf"' and pf = pf"'- 



rjBob 

success 

Now the distinguishability of two density matrices can be 

Bob „Bob 



described by the fidelity |22| F{t 



The 



detailed definition and properties of the fidelity are irrel- 
evant for our discussion. It suffices to note the following 
fact: In order to show that it is impossible for Bob to 
distinguish with certainty between the two density ma- 
trices without communications from Alice, all we need to 

Bob pBob 

prove is that f ( ^°|£r°^ , T^mrf^) ^ or equivalently 

the supports of pf„° ^g^, and pf°,\^^^ are not orthogonal 
to each other. The proof of this claim is simple: Ow- 
ing to causality, the density matrix of Bob is conserved 
throughout Alice's measurement, i.e., 



Bob 
r success 



r J allure 



Bob 

r initial 



a2|l)(l|+62|2)(2|. 



(Bl) 
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Since pgft^^i has a two-dimensional support, p^°^^^,, 
must have a support of at most two dimensions. On 
the other hand, as pf °^ is the reduced density matrix for 
a singlet, pfuness^ being the sum of pf°^'s, must have 
a support of at least two dimensions. Combining these 
two statements, pfuccess ^ support of exactly two di- 
mensions. Now that both and p^°ccess have two- 
dimensional supports, the support of pf^uure ™.ust be a 
subspace of the support of pfuccess- Therefore, we con- 
clude that pf„° cess and pf°u^re do not have orthogonal 

Bob pBob 

supports and hence the fidelity F{ J'^^^Si^^ , ^-/a.^^re j _^ 

^''P success P failure 

0. QED 



APPENDIX C: SOME DETAILS OF PROOF OF 
LEMMA 2 

As before Alice attaches an ancilla to her system A 
and the evolution needed now is 



|0)a|i). 



k 



(Cl) 



That is, the state \ j)A of the particle remains unchanged, 
but the ancilla is brought to an equal superposition of 
all states |l)a, • • • , \k)a, with the exception of \j)a- The 
evolution of the state of the ancilla and the pair can, 
therefore, be summarized as 

k 

- itt/U^, t u}M.l (C2) 

i.e., each state \i)a of the ancilla is correlated with a 
different k-1 dimensional maximally entangled state. 

Next, Alice measures the state of her ancilla. No mat- 
ter what result she obtains, the pair of particles is left in 
a (k-l)-dimensional maximally entangled state. Which 
particular state is obtained will depend on Alice's re- 
sult. Suppose Alice finds the ancilla in the state \io)a- 
Then the pair is in the state -^^Y,''.^^,^_^^^ \j)A\j)B- 
If they wish, Alice and Bob can now convert this state 
into the standard (k-l)-dimensional maximally entangled 
state 5I]j=i 1.7 ) A 1,7) B- This can be realized by Alice 

communicating to Bob the result of her measurement, 
such that both of them know which (k-l)-dimensional 
maximally entangled state has been obtained and then 
having both of them perform appropriate local unitary 
transformations of their particles. 

Now starting with a maximally entangled r- 
dimensional state, one can repeat our argument to re- 
duce it to a maximally entangled (r — l)-dimensional 



state, (r — 2)-dimensional state, etc until we obtain an 
s-dimensional state. This shows that any maximally en- 
tangled state can be reduced to one with a lower dimen- 
sion. 



APPENDIX D: PROOF OF LEMMA 3 

Since l^'^e^''^) is an m-ME-state, all we need to show is 

the reduction with certainty from |*^rc^'''?) to l^^c'''''"^) 
whenever q > 1. The proof here is analogous to that of 
Lemma 2. 

Suppose Alice attaches an ancilla to her system and 
evolves them in the following manner: 

|0)„bV-(^^EN)a)|j)A, 

for I < j < m — p 

1 P+Q 

i=Ui¥=j-{m-p) 

{oim — p+l<j<m + q. (Dl) 



In words, the ancilla is brought to an equal superposition 
of all states |l)a) • " " ; |p + 9)a if the state of Alice's system 
is where 1 < j < m — p. However, when Alice's 
system is in where m — p + 1 < j < m + q, the 
ancilla is brought to an equal superposition of all states 
|l)a, ■■ ■ Ap + l)a with the exception of \j - (m - p))a. 
Upon measuring the state of the ancilla and applying 
local unitary transformations to their respective systems, 
Alice and Bob end up in a new precursor This 
proves the reduction from to By 



repeating this reduction process, one can, with certainty, 
reach '°) which is an m-ME-state. 



APPENDIX E: PROOF OF LEMMA 4 



It is convenient here to define Sm-r+i = J2iLm-r+i ^i- 
Then, 



m 



r/m ^ rim 

m „ 



1 r 



^m—r+l 



Now, 



B, 



r+2 



r + 1' ' ' ' r 

rSjn-r+l + rXm-r > {r + l)Sm-r+l 

(El) 



" ['5'm— r+1 ~\~ ^m—r H" -^m— r — l] 



(r + 2)' 

> TV [Sm-r+1 + 2Am-r] 



(r + 2) ' 



{r + 2){r + l) 



[(r + l)5„_^+i + 2(r + 1)A„_^] 
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m 



-[{r + l)S, 



m— r+1 



+ r\„i-r + {r + 2)A,„_r] 



i=l 



(G3) 



(r + 2)(r + l)' 

777- 

> . „w , -|J (^ + l)S'm~r+l + Sm-r+1 + [r + 2)Xm-r] 

(r + Z)(r + i) Equating this with the partial trace of P|^')(\E'|P over 

= -Ur + 2)Sm^r+i + (r + 2)Xm-r] Ha, we find that the ^^^P\i)'s form an orthonormal set. 

(r + 2)(r + l) V" 

m _ . The residual state 



(r + 1) 



[Srn—r+1 + -^m— r] 



where Eq. (El) is used in obtaining the fifth hne. QED. 



APPENDIX F: PROOF OF LEMMA 5 

Let us consider the list of values of B"^ , B™ , • • • , -B™ . 
Since 1 > miUr-B™, as a function of r, _B™ must 

start to increase at some point, i.e., there exists ro such 
that i3™+i > 



But then, by Lemma 4, i 2 > 



-B™+3 > -B™^_2, etc. In words, once starts to 
increase, it wih continue to do so. Let us focus on the last 
minimal point of the function _B™. i.e., the largest value 
ri such that B^.^ = min^S;:'. By definition, > 
which, from Lemma 4, implies that _B™ < < ■ • ■ < 

— 1. This completes the first part of the proof. 

Moreover, we claim that B^" > B'^ > ■ ■ ■ > B™. We 
prove this by contradiction. Assuming the contrary, there 
exists an a < ri such that B^_i < B™. Then Lemma 4 
implies that < i?™, which is impossible because it 

contradicts the fact that _B™ — min^-B™. 

Combining the results of the above two paragraphs, we 
conclude that B^" > ^2 '>•••> 5™ < < • • • < 

B,™ = 1. QED. 



(E2) 



(l-P)!*)^ jAi--|l"l) 



+ ,/A2--|2"2) 



|3"3>. 



(G4) 



Notice that the |«")'s are orthonormal because 

(j|(l-P)(l-P)|.> 
= (j|(l-2P + PP)|z) 
= (j|(l-2PP + PP)|z) 
= (j|(l-PP)|z) 
= 0. 



(G5) 



Here the last equality follows from the fact that P|z)'s are 
orthogonal to one another. This shows that an extraction 
of a 3-ME-state of probability a leads to a decrease of 
each A's by a/3. The same argument can be applied to 
each of P = Pi, P2, ■ ■ ■ , Pr- This shows that after the 
extraction with a probability p of a 3-ME-state from ^, 
the eigenvalues of the reduced density matrix of the un- 
normalized residual state satisfy A ■ = Ai — p/3. QED. 



APPENDIX G: PROOF OF LEMMA 6 



The following proves the claim in Lemma 6 that X[ = 
Ai —p/3. For simplicity, we shall use projection operators 
rather than (POVMs). As noted in Section ||, there is 
no loss in generality. Let Pi , P2 , • • • , P^ be the set of 
projection operators for extracting some 3-ME-state from 

Now suppose P gives a 3-ME-state with a probability 



with 



I*) =P|*)-(-(l-P)|*) 

PI*) = (v/aTp|i)) |1) 



(01) 



A0PI2) 12 



A3PI3) |3). 



(G2) 



Since P\'^) is 3-ME-state with a norm a, its reduced 
density matrix for B, 



[1] 
[2] 



[3] 

[4] 
[5] 

[6] 
[7] 
[8] 

[9] 
[10] 



A. Einstein, B. Podolsky, and N. Rosen, Phys. Rev. 47, 
777 (1935). 

J. S. Bell, Physics 1, 195 (1964); J. F. Clauser, M. A. 
Horne, A. Shimony, and R. A. Holt, Phys. Rev. Lett. 23, 
880 (1969). These two papers can also be found in J. A. 
Wheeler and W. H. Zurek, Quantum Theory and Mea- 
surement (Princeton University Press, Princeton, 1983), 
p. 403 and p. 409 respectively. 

C. H. Bennett and S. J. Wiesner, Phys. Rev. Lett. 69, 
2881 (1992). 

C. H. Bennett et ai, Phys. Rev. Lett. 70, 1895 (1993). 
H. Buhrman, R. Cleve, and W. van Dam, Los Alamos 



e- print archive quant- ph/9705033. 

C. H. Bennett et ai, Phys. Rev. A53, 2046 (1996). 

C. H. Bennett et al, Phys. Rev. A54, 3824 (1996). 

S. Popescu and D. Rohrlich, Los Alamos preprint archive 

quant-ph/9610044 

See, for example, the Appendix of L. P. Hughston, R. 
Jozsa and W. K. Wootters, Phys. Lett. A183, 14 (1993). 
A more precise discussion goes as follows. Suppose the 
goal is to transform an initial state ^ into Ti. By keep- 
ing track of the results of all her measurement, Alice will 



15 



[11 



[12] 



[13] 
[14] 

[15] 
[16] 
[17] 



generally obtain the state of a larger system T — Ti®T2. 
However, she can easily reduce its Schmidt coefficients to 
those of Ti simply by performing a complete measure- 
ment along the second system. 

This symmetry is originally defined only on the subspace 
of the Hilbert space with Xk 0, but it can be trivially 
extended to the whole Hilbert space by pairing, in the 
subspace where — 0, any orthonormal basis |a5,')'s of 
Ha with any orthonormal basis \b'k)'s of Hb- 
This interchange symmetry is reminiscent of the sym- 
metry in two-party cryptographic protocols discussed by, 
for example, J. Kilian, in Proceedings of the 20 th An- 
nual Symposium on the Theory of Computing, (ACM, 
New York, 1988), p. 20. The potential relevance of this 
interchange symmetry in quantum two-party protocols 
has been speculated by Mayers jl^ in the discussion of 
the impossibility of unconditionally secure quantum bit 
commitment llsMlq]. 

D. Mayers, Phys. Rev. Lett. 78, 3414 (1997). 

H.-K. Lo and H. F. Chau, Phys. Rev. Lett. 78, 3410 

(1997). 

H.-K. Lo and H. F. Ch au, PhysicaD 120, 177 (1998). 



[18] 
[19] 



D. Mayers, Report No. juant-ph/9603015 



This equivalence (or invariance) between the outcomes of 
Ahce and Bob's local experiments is easy to understand 
in the case where Alice and Bob share no initial entangle- 
ment. In this case, consider, for instance. Bob prepares 
a spin-1/2 object in his own laboratory along the z-axis 
and then measures its spin along the x-axis. The outcome 
of this simple experiment is, of course, equally probable 
to be up or down. Such an experiment by Bob can be 
mapped into an experiment by Alice in which she pre- 
pares a spin-1/2 object in her own laboratory along the 
z-axis and then measures its spin along the x-axis. Just 
Uke Bob's experiment, Alice's experiment also gives equi- 
probable outcomes. In this sense, the two experiments are 
equivalent. On the contrary, suppose Alice, but not Bob, 
share some initial entanglement with Charles. Alice can 
then teleport states to and from with Charles whereas 
Bob cannot. It is then clear that Alice's local experiments 
(plus classical communications) are not generally equiv- 
alent to that of Bob. In conclusion, entanglement with 
third party generally destroys equivalence of local exper- 
iments between two observers. In this paper, we show, 
however, that two persons, Alice and Bob, sharing a pure 
entangled initial state still respect the equivalence in lo- 
cal experiments. This observation, which greatly simpli- 
fies our analysis, is not a priori obvious. Note that this 
equivalence is used here to prove that two-way commu- 
nications can be reduced to one-way communications in 
the context of entanglement manipulations of a pure en- 
tangled state. Curiously, another equivalence (symmetry) 
argument has previously been used to prove that two-way 
communications is provably better than one-way commu- 
nications in entanglement purification of mixed states Q. 
In our opinion, the power of symmetry arguments in en- 
tanglement manipulations remains to be fully explored. 
We use the superscript MAX because, as will be shown 
in Section 6, the supremum probability is attainable by 
the optimal strategy. 

This lemma has also been proven by other groups such as 



by C. H. Bennett and J. Smolin (private communications) 
and by M. Nielson (private communications). We thank 
them for helpful discussions. 

[20] B. Schumacher, Phys. Rev. A51, 2738 (1995); R. Jozsa 
and B. Schumacher, J. Modern Optics 41, 2343 (1994). 

[21] It is an interesting open question whether there exists 
any mixed state that respects an interchange symmetry 
between Alice and Bob for all strategies of entanglement 
manipulations. We thank M. A. Nielson for raising this 
question. 

[22] R. Jozsa, J. Modern Optics 41, 2315 (1994). 



16 



